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HUMAN RPS6KA6-RELATED GENE VARIANT ASSOCIATED 

WITH LUNG CANCERS 

FIELD OF THE INVENTION 

[0001] The invention relates to the nucleic acid and polypeptide sequences 
5 of a novel human RPS6KA6-related gene variant, the preparation process 
thereof, and the uses of the same in diagnosing cancers, in particular, T cell 
lymphoblastic lymphoma. 

BACKGROUND OF THE INVENTION 

[0002] Lymphoma is the third most common cancer among children in the 
10 world. The major types of lymphoma are Hodgkin's and non-Hodgkin's. 
Non-Hodgkin's lymphoma (NHL) occurs more frequently than Hodgkin's 
disease among children. The major histopathological categories of NHL in 
children are (1) Burkitt's and Burkitt's like lymphomas; (2) lymphoblastic 
lymphomas; (3) anaplastic large cell lymphoma; and (4) diffuse large cell 
15 lymphomas (Percy et al., 1999). In recent years, much progress has been 
made toward understanding the molecular and cellular biology of NHL. 
Many important contributions have been made by the characterization of 
chromosomal translocations and the identification of several key genetic 
factors associated with each type of NHL (Percy et al., 1999). However, 
20 the treatments of NHL still mainly depend on chemotherapy and 
radiotherapy. This is because the molecular mechanisms underlying the 
pathogenesis of NHL remain largely unclear. 

[0003] Lymphoblastic lymphoma, a predominant T-cell tumor, accounts 
for about 30% of childhood NHL (National Cancer Institute Cancer.gov 
25 Web site, 2004). Recent studies have shown that T cell lymphoblastic 
lymphoma is caused by abnormal expression of several genetic factors such 
as BCL-6 (Hyjek et al., (2001) Blood. 97: 270-276), MSH2/Lmo-2/Tal-l 
(Lowsky et al., (1997) Blood. 89: 2276-2282) and Stat5 (Kelly et al., 
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(2003) J Exp Med. 198: 79-89). Stat5 has been shown to play a role in cell 
cycle regulation (Nieborowska-Skorska et al., (1999) J Exp Med. 189: 
1229-1242; Martino et al., (2001) J Immunol. 166: 1723-1729). Therefore, 
future strategies for the prevention and treatment of T cell lymphoblastic 
lymphoma will focus on the elucidation of genetic substrates associated 
with cell cycle regulation. Interestingly, three members (RSK1, RSK2, and 
RSK3) of the ribosomal S6 kinase (RSK) family have been shown to be 
involved in Jjie cell cycle regulation and may play a role in T cell 
(Edelmann et al., (1996) J Biol Chem. 271: 963-71; Zhao et al., (1996) J 
Biol Chem. 271: 29773-29779; Brennan et al., (1999) Mol Cell Biol. 
19:4729-38; Suzuki et al., (2001) J Immunol. 167:3064-73). Thus, it raised 
a possibility that RPS6KA6 (ribosomal S6 kinase 4; also named RSK4; 
GenBank accession # AF 184965) has a role in the development of T cell 
lymphoblastic lymphoma. Therefore, the discovery of gene variants of 
RPS6KA6 may be important targets for diagnostic markers of T cell 
lymphoblastic lymphoma. 

SUMMARY OF THE INVENTION 

[0004] The present invention provides an RPS6KA6-related gene variant 
(RPS6KA6V) which is negatively expressed in human T cell lymphoblastic 
lymphoma. The nucleotide sequence of the gene variant and the 
polypeptide sequence encoded thereby can be used for the diagnosis of any 
diseases associated with this gene variant or T cell lymphoblastic 
lymphoma. 

[0005] The invention further provides an expression vector and host cell 
for expressing the variant. 

[0006] The invention further provides a method for producing the variant. 

[0007] The invention further provides an antibody specifically binding to 
the variant. 
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[0008J The invention also provides methods for detecting the presence of 
the variant in a mammal. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0009] FIG. 1 shows the nucleic acid sequence (SEQ ID NO: 1) and amino 
acid sequence (SEQ ID NO: 2) of RPS6KA6V. 

[0010] FIG. 2 shows the nucleotide sequence alignment between the 
human RPS6KA6 gene (SEQ ID NO: 3) and its related gene variant 
(RPS6KA6V). 

[0011] FIG. 3 shows the amino acid sequence alignment between the 
human RPS6KA6 protein (SEQ ID NO: 4) and its related gene variant 
(RPS6KA6V). 

[0012] FIG. 4 shows the semi-quantitative RT-PCR analysis of 
RPS6KA6V in human cell lines, wherein the left and right columns are 100 
bp DNA markers. 

[0013] FIG. 5 shows the semi-quantitative RT-PCR analysis of 
RPS6KA6V in human tissue samples, wherein the left and right columns 
are 100 bp DNA markers. 

DETAILED DESCRIPTION OF THE INVENTION 

[0014] According to the present invention, all technical and scientific 
terms used have the same meanings as commonly understood by persons 
skilled in the art. 

[0015] The term ''antibody" used herein denotes intact molecules (a 
polypeptide or group of polypeptides) as well as fragments thereof, such as 
Fab, R(ab f )2, and Fv fragments, which are capable of binding the epitopic 
determinant. Antibodies are produced by specialized B cells after 
stimulation by an antigen. Structurally, antibody consists of four subunits 



C:\WINDOWS\TEMP\us6722-cip-final.DOC 



including two heavy chains and two light chains. The internal surface 
shape and charge distribution of the antibody binding domain is 
complementary to the features of an antigen. Thus, antibody can 
specifically act against the antigen in an immune response. 

[0016] The term "base pair (bp)" used herein denotes nucleotides 
composed of a purine on one strand of DNA which can be hydrogen 
bonded to a pyrimidine on the other strand. Thymine (or uracil) and 
adenine residues are linked by two hydrogen bonds. Cytosine and guanine 
residues are linked by three hydrogen bonds. 

[0017] The term "Basic Local Alignment Search Tool (BLAST; Altschul 
et al., (1997) Nucleic Acids Res. 25: 3389-3402)" used herein denotes 
programs for evaluation of homologies between a query sequence (amino 
or nucleic acid) and a test sequence as described by Altschul et al. (Nucleic 
Acids Res. 25: 3389-3402, 1997). Specific BLAST programs are described 
as follows: 

[0018] (1) BLASTN compares a nucleotide query sequence with a 
nucleotide sequence database; 

[0019] (2) BLASTP compares an amino acid query sequence with a 
protein sequence database; 

[0020] (3) BLASTX compares the six-frame conceptual translation 
products of a query nucleotide sequence with a protein sequence database; 

[0021] (4) TBLASTN compares a query protein sequence with a 
nucleotide sequence database translated in all six reading frames; and 

[0022] (5) TBLASTX compares the six-frame translations of a 
nucleotide query sequence with the six-frame translations of a nucleotide 
sequence database. 
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[0023] The term "cDNA" used herein denotes nucleic acids that 
synthesized from a mRNA template using reverse transcriptase. 

[0024] The term M cDNA library" used herein denotes a library composed 
of complementary DNAs, which are reverse-transcribed from mRNAs. 

[0025] The term "complement" used herein denotes a polynucleotide 
sequence capable of forming base pairing with another polynucleotide 
sequence. For example, the sequence 5 ' - ATGG ACTTACT-3 ' binds to the 
complementary sequence 5'- AGTAAGTCC AT-3 \ 

[0026] The term "deletion" used herein denotes a removal of a portion of 
one or more amino acid residues/nucleotides from a gene. 

[0027] The term "expressed sequence tags (ESTs)" used herein denotes 
short (200 to 500 base pairs) nucleotide sequence derived from either 5' or 
3' end ofacDNA. 

[0028] The term "expression vector" used herein denotes nucleic acid 
constructs which contain a cloning site for introducing the DNA into 
vector, one or more selectable markers for selecting vectors containing the 
DNA, an origin of replication for replicating the vector whenever the host 
cell divides, a terminator sequence, a polyadenylation signal, and a suitable 
control sequence which can effectively express the DNA in a suitable host. 
The suitable control sequence may include promoter, enhancer and other 
regulatory sequences necessary for directing polymerases to transcribe the 
DNA. 

[0029] The term "host cell" used herein denotes a cell, which is used to 
receive, maintain, and allow the reproduction of an expression vector 
comprising DNA. Host cells are transformed or transfected with suitable 
vectors constructed using recombinant DNA methods. The recombinant 
DNA introduced with the vector is replicated whenever the cell divides. 
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[0030] The term "insertion" or "addition" used herein denotes the addition 
of a portion of one or more amino acid residues/nucleotides to a gene. 

[0031] The term "in silico" used herein denotes a process of using 
computational methods (e.g., BLAST) to analyze DNA sequences. 

[0032] The term "polymerase chain reaction (PCR)" used herein denotes a 
method which increases the copy number of a nucleic acid sequence using 
a DNA polymerase and a set of primers (about 20bp oligonucleotides 
complementary to each strand of DNA) under suitable conditions 
(successive rounds of primer annealing, strand elongation, and 
dissociation). 

[0033] The term "protein" or "polypeptide" used herein denotes a 
sequence of amino acids in a specific order that can be encoded by a gene 
or by a recombinant DNA. It can also be chemically synthesized. 

[0034] The term "nucleic acid sequence" or "polynucleotide" used herein 
denotes a sequence of nucleotide (guanine, cytosine, thymine or adenine) in 
a specific order that can be a natural or synthesized fragment of DNA or 
RNA. It may be single-stranded or double-stranded. 

[0035] The term "reverse transcriptase-polymerase chain reaction (RT- 
PCR)" used herein denotes a process which transcribes mRNA to 
complementary DNA strand using reverse transcriptase followed by 
polymerase chain reaction to amplify the specific fragment of DNA 
sequences. 

[0036] The term "transformation" used herein denotes a process describing 
the uptake, incorporation, and expression of exogenous DNA by 
prokaryotic host cells. 
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[0037] The term "transfection" used herein denotes a process describing 
the uptake, incorporation, and expression of exogenous DNA by eukaryotic 
host cells. 

[0038] The term "variant" used herein denotes a fragment of sequence 
(nucleotide or amino acid) inserted or deleted by one or more 
nucleotides/amino acids. 

[0039] The present invention provides the polypeptides of a novel human 
RPS6KA6-related gene variant, as well as the nucleic acid sequences 
encoding the same. 

[0040] According to the present invention, human RPS6KA6 cDNA 
sequence was used to query the human lung EST databases (a normal lung, 
a large cell lung cancer, a squamous cell lung cancer and a small cell lung 
cancer) using BLAST program to search for RPS6KA6-related gene 
variants. Four ESTs showing similarity to RPS6KA6 were identified. Two 
were from the large cell lung cancer, one was from the squamous cell lung 
cancer and one was from the SCLC databases. Their corresponding cDNA 
clones were found to be identical after sequencing and named RPS6KA6V 
(RPS6KA6 variant). FIG. 1 shows the nucleic acid sequence of 
RPS6KA6V (SEQ ID NO: 1) and the amino acid sequence encoded thereby 
(SEQ ID NO: 2). 

[0041] The full-length of the RPS6KA6V cDNA is a 2403bp clone 
containing a 2094bp open reading frame (ORF) extending from 6bp to 
2099bp, which corresponds to an encoded protein of 698 amino acid 
residues with a predicted molecular mass of 78.2 kDa. To determine the 
variation in sequence of RPS6KA6V cDNA clone, an alignment of 
RPS6KA6 nucleotide/amino acid sequence with RPS6KA6V was 
performed (FIGs. 2 and 3). The results indicate that one major genetic 
deletion was found in the aligned sequences showing that RPS6KA6V is a 
141bp deletion in the sequence of RPS6KA6 from 1978-21 18bp. The lack 
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of 141 bp (corresponding to 47aa) is an in-frame deletion in the amino acid 
sequence of RPS6KA6 and generates a polypeptide of 698 amino acid 
residues of RPS6KA6V (Fig. 3). 

[0042] In the present invention, a search of ESTs deposited in dbEST 
5 (Boguski et al. (1993) Nat Genet. 4: 332-3) at NCBI was performed to 
determine the tissue distribution of RPS6KA6V in silico. The result of in 
silico Northern analysis showed that one EST (GenBank accession number 
AA626690) was found to confirm the absence of 141bp region on 
RPS6KA6V nucleotide sequence. This EST was also generated from a 
10 lung carcinoma cDNA library suggesting that the absence of 141bp 
nucleotide fragment located between 1977-1978bp of RPS6KA6V may 
serve as a useful marker for diagnosing cancers associated with this gene 
variant. Therefore, any nucleotide fragments comprising 1977-1978bp of 
RPS6KA6V may be used as probes for determining the presence of 
is RPS6KA6V under high stringency conditions. An alternative approach is 
that any set of primers for amplifying the fragment containing 1977-1978bp 
of RPS6KA6V may be used for determining the presence of the variant. 

[0043] According to the present invention, the polypeptides of the human 
RPS6KA6V may be produced through genetic engineering techniques. In 

20 this case, they are produced by appropriate host cells, which have been 
transformed by DNAs that code for the polypeptides. The nucleotide 
sequence encoding the polypeptide containing 657-65 8aa of the human 
RPS6KA6V is inserted into an appropriate expression vector, i.e., a vector 
which contains the necessary elements for the transcription and translation 

25 of the inserted coding sequence in a suitable host. The nucleic acid 
sequence is inserted into the vector in a manner that it will be expressed 
under appropriate conditions (e.g., in proper orientation and correct reading 
frame and with appropriate expression sequences, including an RNA 
polymerase binding sequence and a ribosomal binding sequence). 
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[0044] Any method that is known to those skilled in the art may be used to 
construct expression vectors containing sequences encoding the 
polypeptide of the human RPS6KA6V and appropriate 
transcriptional/translational control elements. These methods may include 
in vitro recombinant DNA and synthetic techniques, and in vivo genetic 
recombinants. (See, e.g., Sambrook, J. Cold Spring Harbor Press, 
Plainview N.Y., ch. 4, 8, and 16-17; Ausubel, R. M. et al. (1995) Current 
protocols in Molecular Biology, John Wiley & Sons, New York N.Y., ch. 
9, 13, and 16.) 

[0045] A variety of expression vector/host systems may be utilized to 
express the polypeptide-coding sequence. These include, but not limited 
to, microorganisms such as bacteria transformed with recombinant 
bacteriophage, plasmid, or cosmid DNA expression vector; yeast 
transformed with yeast expression vector; insect cell systems infected with 
virus (e.g., baculovirus); plant cell system transformed with viral 
expression vector (e.g., cauliflower mosaic virus, CaMV, or tobacco 
mosaic virus, TMV); or animal cell system infected with virus (e.g., 
vaccina virus, adenovirus, etc.). Preferably, the host cell is a bacterium, 
and most preferably, the bacterium is E. coli. 

[0046] Alternatively, the polypeptide of the human RPS6KA6V or the 
fragments thereof may be synthesized by using chemical methods. For 
example, peptide synthesis can be performed using various solid-phase 
techniques (Roberge, J. Y. et al. (1995) Science 269: 202 to 204). 
Automated synthesis may be achieved by using the ABI 431 A peptide 
synthesizer (Perkin-Elmer). 

[0047] According to the present invention, the polypeptide and nucleic 
acid sequence of the human RPS6KA6V can be used as immunogen and 
template of primers/or probes, respectively. 



C:\WJN DOWS\TEMPWs6722-cip.flnal.DOC 



- 10- 



[0048] The present invention further provides the antibodies which 
specifically bind one or more out-surface epitopes of the polypeptides of 
the human RPS6KA6V. 

[0049] According to the present invention, immunization of mammals 
with immunogens described herein, preferably humans, rabbits, rats, mice, 
sheep, goats, cows, or horses, is performed following procedures well 
known to those skilled in the art, for the purpose of obtaining antisera 
containing polyclonal antibodies or hybridoma lines secreting monoclonal 
antibodies. 

[0050] Monoclonal antibodies can be prepared by standard techniques, 
given the teachings contained herein. Such techniques are disclosed, for 
example, in U.S. Patent Nos. 4,271,145 and 4,196,265. Briefly, an animal 
is immunized with the immunogen. Hybridomas are prepared by fusing 
spleen cells from the immunized animal with myeloma cells. The fusion 
products are screened for those producing antibodies that bind to the 
immunogen. The positive hybridoma clones are isolated, and the 
monoclonal antibodies are recovered from those clones. 

[0051] Immunization regimens for production of both polyclonal and 
monoclonal antibodies are well-known in the art. The immunogen may be 
injected by any of a number of routes, including subcutaneous, intravenous, 
intraperitoneal, intradermal, intramuscular, mucosal, or a combination 
thereof. The immunogen may be injected in soluble form, aggregate form, 
attached to a physical carrier, or mixed with an adjuvant, using methods 
and materials well-known in the art. The antisera and antibodies may be 
purified using column chromatography methods well known to those 
skilled in the art. 

[0052] According to the present invention, antibody fragments which 
contain specific binding sites for the polypeptides may also be generated. 
For example, such fragments include, but are not limited to, F(ab , )2 
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fragments produced by pepsin digestion of the antibody molecule and Fab 
fragments generated by reducing the disulfide bridges of the F(ab , )2 
fragments. 

[0053] The subject invention also provides methods for diagnosing the 
diseases associated with the human RPS6KA6V or T cell lymphoblastic 
lymphoma, by the utilization of the nucleic acid sequence, the polypeptide 
of the human RPS6KA6V, and the antibodies against the polypeptide. 

[0054] Many gene variants have been found to be associated with diseases 
(Stallings-Mann et al., (1996) Proc Natl Acad Sci U S A 93: 12394-9; Liu 
et al., (1997) Nat Genet 16:328-9; Siffert et al., (1998) Nat Genet 18: 45 to 
8; Lukas et al., (2001) Cancer Res 61: 3212 to 9). Since RPS6KA6V clone 
was isolated from lung cancers cDNA libraries and identified its expression 
in lung carcinoma cDNA library using in silico Northern analysis, it is 
advisable that RPS6KA6V may serve as a marker for the diagnosis of 
human cancers. Thus, the expression level of RPS6KA6V relative to 
RPS6KA6 may be a useful indicator for screening of patients suspected of 
having cancers. This suggests that the index of relative expression level 
(mRNA or protein) may confer an increased susceptibility to cancers. 
Fragments of RPS6KA6 mRNA may be detected by RT-PCR approach. 
Polypeptides of RPS6KA6V may be determined by the binding of 
antibodies to these polypeptides. These approaches may be performed in 
accordance with conventional methods well known to persons skilled in the 
art. 

[0055] According to the present invention, the expression of the gene 
variant mRNA in sample may be determined by, but not limited to, RT- 
PCR. Using TRIZOL reagents (Life Technology), total RNA may be 
isolated from patient samples. Tissue samples (e.g., biopsy samples) are 
powdered under liquid nitrogen before homogenization. RNA purity and 
integrity are assessed by absorbance at 260/280 nm and by agarose gel 
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electrophoresis. A set of primers can be designed to amplify the expected 
size of specific PCR fragments of RPS6KA6V. For example, one of the 
primers may be designed to have a sequence comprising the nucleotides of 
SEQ ID NO: 1 containing nucleotides 1974 to 1979, and the other may be 
designed to have a sequence complementary to the nucleotides of SEQ ID 
NO: 1 at any other locations. Alternatively, one of the primers may be 
designed to have a sequence complementary to the nucleotides of SEQ ID 
NO: 1 upstream of nucleotide 1977 and the other may be designed to have 
a sequence comprising the nucleotides of SEQ ID NO: 1 downstream of 
nucleotide 1978. In this case, both RPS6KA6 and RPS6KA6V will be 
amplified. The length of the PCR fragment from RPS6KA6V will be 
141 bp shorter than that from RPS6KA6. PCR fragments are analyzed on a 
1% agarose gel using five microliters (10%) of the amplified products. The 
intensity of the signals may be determined by using the Molecular Analyst 
program (version 1.4.1; Bio-Rad). Thus, the index of relative expression 
levels for each co-amplified PCR product may be calculated based on the 
intensity of signals. 

[0056] The RT-PCR experiment may be performed according to the 
manufacturers instructions (Boehringer Mannheim). A 50jal reaction 
mixture containing 2jil total RNA (O.l^g/pil), each primer (20 pM), 1 jal 
each dNTP (10 mM), 2.5 ^il DTT solution (100 mM), 10 jal 5X RT-PCR 
buffer, 1 |al enzyme mixture, and 28.5 (il sterile distilled water may be 
subjected to the conditions such as reverse transcription at 60°C for 30 
minutes followed by 35 cycles of denaturation at 94°C for 2 minutes, 
annealing at 60°C for 2 minutes, and extension at 68°C for 2 minutes. The 
RT-PCR analysis may be repeated twice to ensure reproducibility, for a 
total of three independent experiments. 

[0057] The expression of the gene variant can also be analyzed using 
Northern Blot hybridization approach. Specific fragment of the 
RPS6KA6V may be amplified by polymerase chain reaction (PCR) using 
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primer set designed for RT-PCR. The amplified PCR fragment may be 
labeled and serve as a probe to hybridize the membranes containing total 
RNAs extracted from the samples under the conditions of 55°C in a 
suitable hybridization solution for 3 hr. Blots may be washed twice in 2 x 
SSC, 0.1% SDS at room temperature for 15 minutes each, followed by two 
washes in 0.1 x SSC and 0.1% SDS at 65°C for 20 minutes each. After 
these washes, blot may be rinsed briefly in suitable washing buffer and 
incubated in blocking solution for 30 minutes, and then incubated in 
suitable antibody solution for 30 minutes. Blots may be washed in washing 
buffer for 30 minutes and equilibrated in suitable detection buffer before 
detecting the signals. Alternatively, the presence of gene variant (cDNAs 
or PCR) can be detected using microarray approach. The cDNAs or PCR 
products corresponding to the nucleotide sequences of the present invention 
may be immobilized on a suitable substrate such as a glass slide. 
Hybridization can be preformed using the labeled mRNAs extracted from 
samples. After hybridization, nonhybridized mRNAs are removed. The 
relative abundance of each labeled transcript, hybridizing to a cDNA/PCR 
product immobilized on the microarray, can be determined by analyzing 
the scanned images. 

[0058] According to the present invention, the presence of the polypeptide 
of the gene variant in samples may be determined by, but not limited to, the 
immunoassay, which uses the antibody specifically binding to the 
polypeptide. For instance, the polypeptide in protein samples obtained 
from the mammal suspected of having such diseases may be determined by, 
but not limited to, the immunoassay wherein the antibody specifically 
binding to the polypeptide of the invention is brought into contact with the 
protein samples, and the antibody-polypeptide complex is detected. If 
necessary, the amount of antibody-polypeptide complex can be determined. 

[0059] The polypeptides of the human RPS6KA6V may be expressed in 
prokaryotic cells by using suitable prokaryotic expression vectors. The 
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cDNA fragments of RPS6KA6V gene encoding the amino acid coding 
sequence may be PCR amplified using primer set with restriction enzyme 
digestion sites incorporated in the 5' and 3' ends, respectively. The PCR 
products can then be enzyme digested, purified, and inserted into the 
corresponding sites of prokaryotic expression vector in-frame to generate 
recombinant plasmids. Sequence fidelity of this recombinant DNA can be 
verified by sequencing. The prokaryotic recombinant plasmids may be 
transformed into host cells (e.g., E. coli BL21 (DE3)). Recombinant 
protein synthesis may be stimulated by the addition of 0.4 mM 
isopropylthiogalactoside (IPTG) for 3h. The bacterially-expressed proteins 
may be purified. 

[0060] The polypeptide of the gene variant may be expressed in animal 
cells by using eukaryotic expression vectors. Cells may be maintained in 
Dulbecco's modified Eagle's medium (DMEM) supplemented with 10% 
fetal bovine serum (FBS; Gibco BRL) at 37°C in a humidified 5% CO2 
atmosphere. Before transfection, the nucleotide sequence of each of the 
gene variant may be amplified with PCR primers containing restriction 
enzyme digestion sites and ligated into the corresponding sites of 
eukaryotic expression vector in-frame. Sequence fidelity of this 
recombinant DNA can be verified by sequencing. The cells may be plated 
in 12-well plates one day before transfection at a density of 5 x 10 4 cells 
per well. Transfections may be carried out using Lipofectamine Plus 
transfection reagent according to the manufacturer's instructions (Gibco 
BRL). Three hours following transfection, medium containing the 
complexes may be replaced with fresh medium. Forty-eight hours after 
incubation, the cells may be scraped into lysis buffer (0.1 M Tris HC1, pH 
8.0, 0.1% Triton X-100) for purification of expressed proteins. After these 
proteins are purified, monoclonal antibodies against these purified proteins 
(RPS6KA6V) may be generated using hybridoma technique according to 
the conventional methods (de StGroth and Scheidegger, (1980) J Immunol 
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Methods 35:1-21; Cote et al. (1983) Proc Natl Acad Sci U S A 80: 2026- 
30; and Kozbor et al. (1985) J Immunol Methods 81:31-42). 

[0061] According to the present invention, the presence of the 
polypeptides of the gene variant in samples may be determined by, but not 
limited to, Western blot analysis. Proteins extracted from samples may be 
separated by SDS-PAGE and transferred to suitable membranes such as 
polyvinylidene difluoride (PVDF) in transfer buffer (25 mM Tris-HCl, pH 
8.3, 192 mM glycine, 20% methanol) with a Trans-Blot apparatus for lh at 
100 V (e.g., Bio-Rad). The proteins can be immunoblotted with specific 
antibodies. For example, membrane blotted with extracted proteins may be 
blocked with suitable buffers such as 3% solution of BSA or 3% solution of 
nonfat milk powder in TBST buffer (10 mM Tris-HCl, pH 8.0, 150 mM 
NaCl, 0.1% Tween 20) and incubated with monoclonal antibody directed 
against the polypeptides of gene variants. Unbound antibody is removed 
by washing with TBST for 5 X 1 minutes. Bound antibody may be 
detected using commercial ECL Western blotting detecting reagents. 

[0062] The following examples are provided for illustration, but not for 
limiting the invention. 

EXAMPLES 

Analysis of Human Lung EST Databases 

[0063] Expressed sequence tags (ESTs) generated from the large-scale 
PCR-based sequencing of the 5'-end of human lung (normal, SCLC, 
squamous cell lung cancer and large cell lung cancer) cDNA clones were 
compiled and served as EST databases. Sequence comparisons against the 
nonredundant nucleotide and protein databases were performed using 
BLASTN and BLASTX programs (Altschul et al., (1997) Nucleic Acids 
Res. 25: 3389-3402; Gish and States, (1993) Nat Genet 3:266-272), at the 
National Center for Biotechnology Information (NCBI) with a significant 
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cutoff of p<10'10. ESTs representing putative RPS6KA6V gene were 
identified during the course of EST generation. 

Isolation of cDNA Clones 

[0064] Four identical cDNA clones exhibiting EST sequences similar to 
the RPS6KA6 gene were isolated from lung cancer cDNA libraries and 
named RPS6KA6V. The inserts of these clones were subsequently excised 
in vivo from the AZAP Express vector using the ExAssist/XLOLR helper 
phage system (Stratagene). Phagemid particles were excised by coinfecting 
XL 1 -BLUE MRF' cells with ExAssist helper phage. The excised 
pBluescript phagemids were used to infect E. coli XLOLR cells, which 
lack the amber suppressor necessary for ExAssist phage replication. 
Infected XLOLR cells were selected using kanamycin resistance. Resultant 
colonies contained the double stranded phagemid vector with the cloned 
cDNA insert. A single colony was grown overnight in LB-kanamycin, and 
DNA was purified using a Qiagen plasmid purification kit. 

Full Length Nucleotide Sequencing and Database Comparisons 

[0065] Phagemid DNA was sequenced using the Epicentre#SE9101LC 
SequiTherm EXCELTMji DN a Sequencing Kit for 4200S-2 Global NEW 
IR 2 DNA sequencing system (LI-COR). Using the primer- walking 
approach, full-length sequence was determined. Nucleotide and protein 
searches were performed using BLAST against the non-redundant database 
ofNCBI. 

In Silico Tissue Distribution (Northern) Analysis 

[0066] The coding sequence for each cDNA clones was searched against 
the dbEST sequence database (Boguski et al., (1993) Nat Genet. 4: 332-3) 
using the BLAST algorithm at the NCBI website. ESTs derived from each 
tissue were used as a source of information for transcript tissue expression 
analysis. Tissue distribution for each isolated cDNA clone was determined 
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by ESTs matching to that particular sequence variants (insertions or 
deletions) with a significance cutoff of p<10~10. 

Reverse Transcriptase-Polymerase Chain Reaction (RT-PCR) Analysis 

[0067] Total RNA was extracted from WI-38 (fibroblast), A549 (lung 
adenocarcinoma), NCI-H661 (lung large cell carcinoma), NCI-H520 (lung 
squamous carcinoma), NCI-H209 (lung small cell carcinoma), JHH-4 
(hepatoma), SUP-T1 (T-cell lymphoblastic lymphoma), Daudi (Burkitt's 
lymphoma), Ramos (Burkitt's lymphoma), and Raji (Burkitt's lymphoma) 
cell lines and from four breast cancer, two gastric ulcer, two colon cancer, 
two hepatoma, one Grave's disease, one colon cancer, one pancreatic 
carcinoma, one left neck tumor, one gastric carcinoma, two thyroid tumor, 
one spleen, one pancreatic abscess, one Gastric carcinoma, one 
adenomatous polyposis, one right neck lymph tissue, one liver cirrhosis, 
and two parotid gland mixed tumor biopsied samples, RNA purity and 
integrity were assessed by the absorbance at 260/280 nm and by agarose 
gel electrophoresis. 

[0068] The forward and reverse primers for RPS6KA6V were 5'- 
GGAGCAAAGGGAGC AATGGTTG-3 ' (SEQ ID NO: 5) and 5'- 
TCTTC ATCC AGTTTGGCCTAGG-3 5 (SEQ ID NO: 6), respectively. The 
expected size of the specific PCR fragment was 170 bp. Glyceraldehyde-3- 
phosphate dehydrogenase (GAPDH; accession No. M33197) was used as 
an internal control. The forward and reverse primers for GAPDH were 5'- 
TGGGTGTGA ACC ATGAG AAG-3 ' (SEQ ID NO: 7) and 5'- 
GTGTCGCTGTTGAAGTCAGA-3 ' (SEQ ID NO: 8), respectively. The 
expected size of the PCR fragment was 472 bp. The electrophoresis results 
of the RPS6KA6V mRNA expression patterns in 10 cell lines and 25 
biopsied samples determined by RT-PCR are shown in Figs. 4 and 5. The 
results showed that RPS6KA6V mRNA was consistently expressed in all 
cell lines and tissues investigated except in the T-cell lymphoblastic 
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lymphoma cell line. This suggests that RPS6KA6V can be used for 
diagnosing T-cell lymphoblastic lymphoma when RPS6KA6V mRNA 
cannot be detected. 
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SEQUENCE LISTING 

<110> Dai, Ken-Shwo 

<120> HUMAN RPS 6KA 6 -RELATED GENE VARIANT ASSOCIATED WITH LUNG CANCERS 

<130> U 015086-3 

<140> — 

<141> 2004-03-23 

<160> 8 

<170> Patentln version 3.1 

<210> 1 

<211> 2403 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (6) . . (2099) 

<223> 

<400> 1 

gggaa atg eta cca ttc get cct cag gac gag ccc tgg gac cga gaa a;tg 50 

Met Leu Pro Phe Ala Pro Gin Asp Glu Pro Trp Asp Arg Glu Met 
15 10 " 15 

gaa gtg ttc age ggc ggc ggc gcg age age ggc gag gta aat ggt ctt 98 
Glu Val Phe Ser Gly Gly Gly Ala Ser Ser Gly Glu Val Asn Gly Leu 
20 25 30 

aaa atg gtt gat gag cca atg gaa gag gga gaa gca gat tct tgt cat 146 
Lys Met Val Asp Glu Pro Met Glu Glu Gly Glu Ala Asp Ser Cys His 
35 % s 4 0 4 5 

gat gaa gga gtt gtt aaa gaa ate cct att act cat cat gtt aag gaa 194 
Asp Glu Gly Val Val Lys Glu He Pro He Thr His His Val Lys Glu 
50 55 60 

ggc tat gag aaa gca gat cct gca cag ttt gag ttg etc aag gtt ctt 242 
Gly Tyr Glu Lys Ala Asp Pro Ala Gin Phe Glu Leu Leu Lys Val Leu 
65 70 75 



ggt cag ggg 
Gly Gin Gly 
80 

cct gat get 
Pro Asp Ala 



tta aaa gtt 
Leu Lys Val 



gtg gaa gta 
Val Glu Val 
130 



tea ttt' gga 
Ser Phe Gly 
85 

ggg cag etc 
Gly Gin Leu 
100 

cga gac aga 
Arg Asp Arg 
115 

aat cat cca 
Asn His Pro 



aag gtt ttt 
Lys Val Phe 



tat gca atg 
Tyr Ala Met 



gtt egg aca 
Val Arg Thr 
120 

ttt att gtc 
Phe He Val 
135 



ctt gtt aga 
Leu Val Arg 
90 

aag gtg tta 
Lys Val Leu 
105 

aag atg gag 
Lys Met Glu 



aaa ttg cac 
Lys Leu His 



aag aag ace 
Lys Lys Thr 



aaa aaa gee 
Lys Lys Ala 
110 

agg gat ata 
Arg Asp He 
125 

tat gee ttt 
Tyr Ala Phe 
140 



ggt 290 

Gly 

95 

tct 338 
Ser 



ctg 386 
Leu 



cag 434 
Gin 
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act gaa ggg 
Thr Glu Gly 
145 

gtt ttc aca 
Val Phe Thr 
160 

aaa ttc tac 
Lys Phe Tyr 



tta gga att 
Leu Gly lie 



gaa ata gga 
Glu lie Gly 
210 

gta gat caa 
Val Asp Gin 
225 

atg get cct 
Met Ala Pro 
240 

tgg tgg tea 
Trp Trp Ser 



cca ttt caa 
Pro Phe Gin 



gca aaa ctt 
Ala Lys Leu 
290 

eta agg atg 
Leu Arg Met 
305 

gga gtt gaa 
Gly Val Glu 
320 

gat aaa tta 
Asp Lys Leu 



gga aaa cca 
Gly Lys Pro 



aca cct aaa 
Thr Pro Lys 
370 



aaa ctg tac 
Lys Leu Tyr 



aga tta tec 
Arg Leu Ser 
165 

etc gca gaa 
Leu Ala Glu 
180 

gtt tat aga 
Val Tyr Arg 
195 

cat ate aaa 
His lie Lys 



gaa aag aag 
Glu Lys Lys 



gaa gta gta 
Glu Val Val 
245 

tat ggt gtt 
Tyr Gly Val 
260 

ggt aaa gac 
Gly Lys Asp 
275 

gga atg cct 
Gly Met Pro 



tta ttc aaa 
Leu Phe Lys 



gaa ate aaa 
Glu lie Lys 
325 

tat aaa aga 
Tyr Lys Arg 
340 

gat gat act 
Asp Asp Thr 
355 

gat tct ccc 
Asp Ser Pro 



tta ata ctg 
Leu lie Leu 
150 

aaa gag gtt 
Lys Glu Val 



ctg gee ctt 
Leu Ala Leu 



gac ctg aag 
Asp Leu Lys 
200 

tta aca gat 
Leu Thr Asp 
215 

get tac tea 
Ala Tyr Ser 
230 

aat agg aga 
Asn Arg Arg 



ctt atg ttt 
Leu Met Phe 



aga aat gag 
Arg Asn Glu 
280 

caa ttt ctt 
Gin Phe Leu 
295 

agg aat cca 
Arg Asn Pro 
310 

aga cat ctg 
Arg His Leu 



gaa gtt caa 
Glu Val Gin 



ttt tgt ttt 
Phe Cys Phe 
360 

ggt ttg cca 
Gly Leu Pro 
375 



gat ttt etc 
Asp Phe Leu 
155 

ctg ttt aca 
Leu Phe Thr 
170 

get ttg gat 
Ala Leu Asp 
185 

cca gaa aac 
Pro Glu Asn 



ttt gga etc 
Phe Gly Leu 



ttt tgt ggt 
Phe Cys Gly 
235 

ggc cat tec 
Gly His Ser 
250 

gaa atg ctt 
Glu Met Leu 
265 

acc atg aat 
Thr Met Asn 



agt get gaa 
Ser Ala Glu 



gca aat aga 
Ala Asn Arg 
315 

ttt ttt gca 
Phe Phe Ala 
330 

cct cct ttc 
Pro Pro Phe 
345 

gat cct gaa 
Asp Pro Glu 



gec agt gca 
Ala Ser Ala 



agg; gga gga 
Arg Gly Gly 



gag gaa gat 
Glu Glu Asp 



cat ctg cac 
His Leu His 
190 

att ttg ctt 
lie Leu Leu 
205 

age aag gag 
Ser Lys Glu 
220 

aca gta gag 
Thr Val Glu 



cag agt get 
Gin Ser Ala 



act ggt act 
Thr Gly Thr 
270 

atg ata tta 
Met lie Leu 
285 

gca caa agt 
Ala Gin Ser 
300 

ttg gga tea 
Leu Gly Ser 



aat att gac 
Asn lie Asp 



aaa cct get 
Lys Pro Ala 
350 

ttt act gca 
Phe Thr Ala 
365 

aat get cat 
Asn Ala His 
380 



gat 482 
Asp 



gtg 530 

Val 

175 

caa 578 
Gin 



gat 626 
Asp 



tea 674 
Ser 



tat 722 
Tyr 



gat 770 

Asp 

255 

ctg 818 
Leu 



aaa 866 
Lys 



ctt 914 
Leu 



gaa 962 
Glu 



tgg 1010 
Trp 

335 

tct 1058 
Ser 



aaa 1106 
Lys 



cag 1154 
Gin 
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ctc ttc aaa 
Leu Phe Lys 

385 

aaa ate act 
Lys lie Thr 
400 

aat gga aat 
Asn Gly Asn 



att ggt gtt 
He Gly Val 



acc aac atg 
Thr Asn Met 
450 

cct tea gaa 
Pro Ser Glu 
465 

att att act 
He He Thr 
480 

gtt acg gat 
Val Thr Asp 



caa aaa tgt 
Gin Lys Cys 



agt aag aca 
Ser Lys Thr 
530 

ctt aaa cct 
Leu Lys Pro 
545 

tea ate agg 
Ser He Arg 
560 

aat gga ctt 
Asn Gly Leu 



gag gtt ctt 
Glu Val Leu 



tta gga gtc 
-Leu Gly Val 
610 

aat ggc ccc 



gga ttc age 
Gly Phe Ser 



cct ate aca 
Pro He Thr 
405 

get gca caa 
Ala Ala Gin 
420 

ggc tec tac 
Gly Ser Tyr 
435 

gaa ttt gca 
Glu Phe Ala 



gag att gaa 
Glu He Glu 



ttg aag gat 
Leu Lys Asp 
485 

tta atg aaa 
Leu Met Lys 
500 

ttc teg gaa 
Phe Ser Glu 
515 

gtt gac tat 
Val Asp Tyr 



agt aat att 
Ser Asn He 



ata tgt gat 
lie Cys Asp 
565 

etc tta act 
Leu Leu Thr 
580 

atg caa cag 
Met Gin Gin 
595 

ctt ttt tac 
Leu Phe Tyr 



aat gat act 



ttt gtt gca 
Phe Val Ala 
390 

agt gca aat 
Ser Ala Asn 



ttt ggt gaa 
Phe Gly Glu 



tct gtt tgc 
Ser Val Cys 
440 

gtg aag ate 
Val Lys He 
455 

ata ttg atg 
He Leu Met 
470 

gtc ttt gat 
Val Phe Asp 



gga gga gag 
Gly Gly Glu 



egg gag get 
Arg Glu Ala 
520 

ctt cat tgt 
Leu His Cys 
535 

tta tac atg 
Leu Tyr Met 
550 

ttt ggg ttt 
Phe Gly Phe 



cca tgc tac 
Pro Cys Tyr 



gga tat gat 
Gly Tyr Asp 
600 

aca atg ttg 
Thr Met Leu 
615 

cct gaa gag 



act tct att 
Thr Ser He 
395 

gta tta cca 
Val Leu Pro 
410 

gta tat gaa 
Val Tyr Glu 
425 

aag cga tgc 
Lys Arg Cys 



att gac aaa 
He Asp Lys 



cgc tat gga 
Arg Tyr Gly 
475 

gat ggt aga 
Asp Gly Arg 
490 

tta ctt gac 
Leu Leu Asp 
505 

agt gat ata 
Ser Asp lie 



caa gga gtt 
Gin Gly Val 



gat gaa tea 
Asp Glu Ser 
555 

gca aaa caa 
Ala Lys Gin 
570 

act gca aac 
Thr Ala Asn 
585 

get get tgt 
Ala Ala Cys 



get ggc tac 
Ala Gly Tyr 



ata ctg ctg 



gca gaa gaa 
Ala Glu Glu 



att gtt cag 
lie Val Gin 



ttg aag gag 
Leu Lys Glu 
430 

ata cat gca 
He His Ala 
445 

agt aag cga 
Ser Lys Arg 
460 

caa cat ccc 
Gin His Pro 



tat gtt tac 
Tyr Val Tyr 



cgt att etc 
Arg He Leu 
510 

eta tat gta 
Leu Tyr Val 
525 

gtt cat cgt 
Val His Arg 
540 

gee agt gca 
Ala Ser Ala 



ctt cga gga 
Leu Arg Gly 



ttt gtt gca 
Phe Val Ala 
590 

gat ate tgg 
Asp He Trp 
605 

act cca ttt 
Thr Pro Phe 
620 

cgt ata ggc 



tat 1202 
Tyr 



ata 1250 

lie. 

415 

gat 1298 
Asp 



act 1346 
Thr 



gac 1394 
Asp 



aac 1442 
Asn 



ctt 1490 

Leu 

495 

aaa « 1538 
Lys 



ata 1586 
lie 



gat 1634 
Asp 



gat 1682 
Asp 



gaa 1730 

Glu 

575 

cct 1778 
Pro 



agt 1826 
Ser . 



get 1874 
Ala 



aat 1922 
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Asn Gly Pro Asn Asp Thr Pro Glu Glu lie Leu Leu Arg lie Gly Asn 
625 630 635 

gga aaa ttc tct ttg agt ggt gga aac tgg gac aat att tea gac gga 1970 
Gly Lys Phe Ser Leu Ser Gly Gly Asn Trp Asp Asn lie Ser Asp Gly 
640 645 650 655 

gca aag gga gca atg gtt gca aca tac tct gec ctg act cac aag acc 2018 
Ala Lys Gly Ala Met Val Ala Thr Tyr Ser Ala Leu Thr His Lys Thr 
660 665 670 

ttt caa cca gtc eta gag cct gta get get tea age tta gec cag cga 2066 
Phe Gin Pro Val Leu Glu Pro Val Ala Ala Ser Ser Leu Ala Gin Arg 
675 680 685 

egg age atg aaa aag cga aca tea act ggc ctg taagatttgt ggtgttccta 2119 
Arg Ser Met Lys Lys Arg Thr Ser Thr Gly Leu 
690 695 

ggccaaactg gatgaagatg aaattaaatg tgtggctttt ttcctattct tatcaaaggc 2179 

atcgttgtct gctaaattac ttgaatatta agtaatatta aatccccatt tttaggggaa 2239 

gtgagattta aaaaaccatt cacaggtcca caatattcat actatgtgtt tgcagtagtg 2299 

ttcaagtgtt tatttaagca tataattggt gtccaccagg tcctcacaac ttctctgcac 2359 

acaagcttct aaaattcctt tcaaataaag ttactttaat attt 2403 

<210> 2 

<211> 698 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Leu Pro Phe Ala Pro Gin Asp Glu Pro Trp Asp Arg Glu Met Glu 
1 5 10 15 

Val Phe Ser Gly Gly Gly Ala Ser Ser Gly Glu Val Asn Gly Leu Lys 
20 25 30 

Met Val Asp Glu Pro Met Glu Glu Gly Glu Ala Asp Ser Cys His Asp 
35 40 45 

Glu Gly Val Val Lys Glu lie Pro He Thr His His Val Lys Glu Gly 
50 55 60 

Tyr Glu Lys Ala Asp Pro Ala Gin Phe Glu Leu Leu Lys Val Leu Gly 
65 70 75 80 

Gin Gly Ser Phe Gly Lys Val Phe Leu Val Arg Lys Lys Thr Gly Pro. 

85 90 95 



4 
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Asp Ala Gly Gin Leu Tyr Ala Met Lys Val Leu Lys Lys Ala Ser Leu 
100 105 110 



Lys Val Arg Asp Arg Val Arg Thr Lys Met Glu Arg Asp lie Leu Val 
115 120 125 



Glu Val Asn His Pro Phe lie Val Lys Leu His Tyr Ala Phe Gin Thr 
130 135 140 



Glu Gly Lys Leu Tyr Leu lie Leu Asp Phe Leu Arg Gly Gly Asp Val 
145 150 155 160 



Phe Thr Arg Leu Ser Lys Glu Val Leu Phe Thr Glu Glu Asp Val Lys 
165 170 175 



Phe Tyr Leu Ala Glu Leu Ala Leu Ala Leu Asp His Leu His Gin Leu 
180 185 190 



Gly lie Val Tyr Arg Asp Leu Lys Pro Glu Asn lie Leu Leu Asp Glu 
195 * 200 205 



lie Gly His lie Lys Leu Thr Asp Phe Gly Leu Ser Lys Glu Ser Val 
210 215 * ~ 220 



Asp Gin Glu Lys Lys Ala Tyr Ser Phe Cys Gly Thr Val Glu Tyr Met 
225 230 235 240 



Ala Pro Glu Val Val Asn Arg Arg Gly His Ser Gin Ser Ala Asp Trp 
245 250 255 



Trp Ser Tyr Gly Val Leu Met Phe Glu Met Leu Thr Gly Thr Leu Pro 
260 265 270 



Phe Gin Gly Lys Asp Arg Asn Glu Thr Met Asn Met lie Leu Lys Ala 
275 280 285 



Lys Leu Gly Met Pro Gin Phe Leu Ser Ala Glu Ala Gin Ser Leu Leu 
290 295 300 



Arg Met Leu Phe Lys Arg Asn Pro Ala Asn Arg Leu Gly Ser Glu Gly 
305 310 315 320 



Val Glu Glu lie Lys Arg His Leu Phe Phe Ala Asn lie Asp Trp Asp 
325 330 * 335 



Lys Leu Tyr Lys Arg Glu Val Gin Pro Pro Phe Lys Pro Ala Ser Gly 
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340 345 350 



Lys Pro Asp Asp Thr Phe Cys Phe Asp Pro Glu Phe Thr Ala Lys Thr 
355 360 365 



Pro Lys Asp Ser Pro Gly Leu Pro Ala Ser Ala Asn Ala His Gin Leu 
370 375 380 



Phe Lys Gly Phe Ser Phe Val Ala Thr Ser lie Ala Glu Glu Tyr Lys 
385 390 395 400 



lie Thr Pro lie Thr Ser Ala Asn Val Leu Pro lie Val Gin lie Asn 
405 410 415 



Gly Asn Ala Ala Gin Phe Gly Glu Val Tyr Glu Leu Lys Glu Asp lie 
420 425 430 



Gly Val Gly Ser Tyr Ser Val Cys Lys Arg Cys lie His Ala Thr Thr 
435 440 " " 445 



Asn Met Glu Phe Ala Val Lys lie lie Asp Lys Ser Lys Arg Asp Pro 
450 455 460 



Ser Glu Glu lie Glu lie Leu Met Arg Tyr Gly Gin His Pro Asn lie 
465 470 475 480 



lie Thr Leu Lys Asp Val Phe Asp Asp Gly Arg Tyr Val Tyr Leu Val 
485 490 495 



Thr Asp Leu Met Lys Gly Gly Glu Leu Leu Asp Arg lie Leu Lys Gin 
500 "* " 505 510 



Lys Cys Phe Ser Glu Arg Glu Ala Ser Asp lie Leu Tyr Val lie Ser 
515 520 525 



Lys Thr Val Asp Tyr Leu His Cys Gin Gly Val Val His Arg Asp Leu 
530 ~ 535 540 



Lys Pro Ser Asn lie Leu Tyr Met Asp Glu Ser Ala Ser Ala Asp Ser 
545 550 ~ 555 560 1 



lie Arg lie Cys Asp Phe Gly Phe Ala Lys Gin Leu Arg Gly Glu Asn 
565 " 570 " 575 



Gly Leu Leu Leu Thr Pro Cys Tyr Thr Ala Asn Phe Val Ala Pro Glu 
580 585 590 
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Val Leu Met Gin Gin Gly Tyr Asp Ala Ala Cys Asp lie Trp Ser Leu 
595 600 605 



Gly Val Leu Phe Tyr Thr Met Leu Ala .Gly Tyr Thr Pro Phe Ala Asn 
610 615 620 



Gly Pro Asn Asp Thr Pro Glu Glu lie Leu Leu Arg lie Gly Asn Gly 
625 630 635 640 



Lys Phe Ser Leu Ser Gly Gly Asn Trp Asp Asn lie Ser Asp Gly Ala 
645 650 655 



660 665 670 



680 685 



695 



Lys Gly 


Ala 


Gin Pro 


Val 




675 


Ser Met 


Lys 


690 




<210> 


3 


<211> 


2235 


<212> 


DNA 


<213> 


Homo 


<220> 




<221> 


CDS 


<222> 


(1) . 


<223> 




<400> 


3 


atg eta 


cca 


Met Leu 


Pro 


1 





5 10 15 

gtg ttc age ggc ggc ggc gcg age age ggc gag gta aat ggt ctt aaa 
Val Phe Ser Gly Gly Gly Ala Ser Ser Gly Glu Val Asn Gly Leu Lys 
20 25 30 

atg gtt gat gag cca atg gaa gag gga gaa gca gat tct tgt cat gat 
Met Val Asp Glu Pro Met Glu Glu Gly Glu Ala Asp Ser Cys His Asp 
35 40 45 

gaa gga gtt gtt aaa gaa ate cct att act cat cat gtt aag gaa ggc 
Glu Gly Val Val Lys Glu lie Pro lie Thr His His Val Lys Glu Gly 
50 55 60 

tat gag aaa gca gat cct gca cag ttt gag ttg etc aag gtt ctt ggt 
Tyr Glu Lys Ala Asp Pro Ala Gin Phe Glu Leu Leu Lys Val Leu Gly 
65 ^ 70 75 80 
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cag ggg tea ttt gga aag gtt ttt ctt gtt aga aag aag acc ggt cct 288 

Gin Gly Ser Phe Gly Lys Val Phe Leu Val Arg Lys Lys Thr Gly Pro , 
85 " 90 95 

gat get ggg cag etc tat gca atg aag gtg tta aaa aaa gee tct tta 336 

Asp Ala Gly Gin Leu Tyr Ala Met Lys Val Leu Lys Lys Ala Ser Leu 
100 " 105 110 

aaa gtt cga gac aga gtt egg aca aag atg gag agg gat ata ctg gtg 384 

Lys Val Arg Asp Arg Val Arg Thr Lys Met Glu Arg Asp lie Leu Val 
115 120 125 

gaa gta aat cat cca ttt att gtc aaa ttg cac tat gec ttt cag act 432 

Glu Val Asn His Pro Phe lie Val Lys Leu His Tyr Ala Phe Gin Thr 
130 135 140 

gaa ggg aaa ctg tac tta ata ctg gat ttt etc agg gga gga gat gtt 480 

Glu Gly Lys Leu Tyr Leu lie Leu Asp Phe Leu Arg Gly Gly Asp Val 

145 " 150 155 160 

ttc aca aga tta tec aaa gag gtt ctg ttt aca gag gaa gat gtg aaa 528 

Phe Thr Arg Leu Ser Lys Glu Val Leu Phe Thr Glu Glu Asp Val Lys 
165 170 175 

ttc tac etc gca gaa ctg gec ctt get ttg gat cat ctg cac caa tta 576 

Phe Tyr Leu Ala Glu Leu Ala Leu Ala Leu Asp His Leu His Gin Leu 
180 185 190 

i 

gga att gtt tat aga gac ctg aag cca gaa aac att ttg ctt gat gaa 624 

Gly lie Val Tyr Arg Asp Leu Lys Pro Glu Asn lie Leu Leu Asp Glu 
195 200 205 

ata gga cat ate aaa tta aca gat ttt gga etc age aag gag tea gta 672 

lie Gly His lie Lys Leu Thr Asp Phe Gly Leu Ser Lys Glu Ser Val 
210 215 220 

gat caa gaa aag aag get tac tea ttt tgt ggt aca gta gag tat atg 720 

Asp Gin Glu Lys Lys Ala Tyr Ser Phe Cys Gly Thr Val Glu Tyr Met 

225 230 235 240 

get cct gaa gta gta aat agg aga ggc cat tec cag agt get gat tgg 7 68 

Ala Pro Glu Val Val Asn Arg Arg Gly His Ser Gin Ser Ala Asp Trp 
245 250 255 

tgg tea tat ggt gtt ctt atg ttt gaa atg ctt act ggt act ctg cca 816 

Trp Ser Tyr Gly Val Leu Met Phe Glu Met Leu Thr Gly Thr Leu Pro 
260 265 270 

ttt caa ggt aaa gac aga aat gag acc atg aat atg ata tta aaa gca 864 

Phe Gin Gly Lys Asp Arg Asn Glu Thr Met Asn Met lie Leu Lys Ala 
275 280 285 

aaa ctt gga atg cct caa ttt ctt agt get gaa gca caa agt ctt eta -912 

Lys Leu Gly Met Pro Gin Phe Leu Ser Ala Glu Ala Gin Ser Leu Leu 
290 295 300 

agg atg tta ttc aaa agg aat cca gca aat aga ttg gga tea gaa gga 960 

Arg Met Leu Phe Lys Arg Asn Pro Ala Asn Arg Leu Gly Ser Glu Gly 

305 310 315 320 

gtt gaa gaa ate aaa aga cat ctg ttt ttt gca aat att gac tgg gat 1008 

8 



-30- 



Val Glu Glu 



aaa tta tat 
Lys Leu Tyr 



aaa cca gat 
Lys Pro Asp 
355 

cct aaa gat 
Pro Lys Asp 
370 

ttc aaa gga 
Phe Lys Gly 
385 

ate act cct 
lie Thr Pro 



gga aat get 
Gly Asn Ala 



ggt gtt ggc 
Gly Val Gly 
435 

aac atg gaa 
Asn Met Glu 
450 

tea gaa gag 
Ser Glu Glu 
465 

att act ttg 
lie Thr Leu 



acg gat tta 
Thr Asp Leu 



aaa tgt ttc 
Lys Cys Phe 
515 

aag aca gtt 
Lys Thr Val 
530 

aaa cct agt 
Lys Pro Ser 
545 

ate agg ata 
lie Arg lie 



lie Lys Arg 
325 

aaa aga gaa 
Lys Arg Glu 
340 

gat act ttt 
Asp Thr Phe 



tct ccc ggt 
Ser Pro Gly 



ttc age ttt 
Phe Ser Phe 
390 

ate aca agt 
lie Thr Ser 
405 

gca caa ttt 
Ala Gin Phe 
420 

tec tac tct 
Ser Tyr Ser 



ttt gca gtg 
Phe Ala Val 



att gaa ata 
lie Glu lie 
470 

aag gat gtc 
Lys Asp Val 
485 

atg aaa gga 
Met Lys Gly 
500 

teg gaa egg 
Ser Glu Arg 



gac tat ctt 
Asp Tyr Leu 



aat att tta 
Asn lie Leu 
550 

tgt gat ttt 
Cys Asp Phe 



His Leu Phe 



gtt caa cct 
Val Gin Pro 
345 

tgt ttt gat 
Cys Phe Asp 
360 

ttg cca gee 
Leu Pro Ala 
375 

gtt gca act 
Val Ala Thr 



gca aat gta 
Ala Asn Val 



ggt gaa gta 
Gly Glu Val 
425 

gtt tgc aag 
Val Cys Lys 
440 

aag ate att 
Lys lie lie 
455 

ttg atg cgc 
Leu Met Arg 



ttt gat gat 
Phe Asp Asp 



gga gag tta 
Gly Glu Leu 
505 

gag get agt 
Glu Ala Ser 
520 

cat tgt caa 
His Cys Gin 
535 

tac atg gat 
Tyr Met Asp 



ggg ttt gca 
Gly Phe Ala 



Phe Ala Asn 
330 

cct ttc aaa 
Pro Phe Lys 



cct gaa ttt 
Pro Glu Phe 



agt gca aat 
Ser Ala Asn 
380 

tct att gca 
Ser lie Ala 
395 

tta cca att 
Leu Pro lie 
410 

tat gaa ttg 
Tyr Glu Leu 



cga tgc ata 
Arg Cys lie 



gac aaa agt 
Asp Lys Ser 
460 

tat gga caa 
Tyr Gly Gin 
475 

ggt aga tat 
Gly Arg Tyr 
490 

ctt gac cgt 
Leu Asp Arg 



gat ata eta 
Asp lie Leu 



gga gtt gtt 
Gly Val Val 
540 

gaa tea gee 
Glu Ser Ala 
555 

aaa caa ctt 
Lys Gin Leu 



lie Asp Trp 
335 

cct get tct 
Pro Ala Ser 
350 

act gca aaa 
Thr Ala Lys 
365 

get cat cag 
Ala His Gin 



gaa gaa tat 
Glu Glu Tyr 



gtt cag ata 
Val Gin lie 
415 

aag gag gat 
Lys Glu Asp 
430 

cat gca act 
His Ala Thr 
445 

aag cga gac 
Lys Arg Asp 



cat ccc aac 
His Pro Asn 



gtt tac ctt 
Val Tyr Leu 
495 

att etc aaa 
lie Leu Lys 
510 

tat gta ata 
Tyr Val lie 
525 

cat cgt gat 
His Arg Asp 



agt gca gat 
Ser Ala Asp 



cga gga gaa 
Arg Gly Glu 



Asp 



gga 1056 
Gly 



aca 1104 
Thr 



etc 1152 
Leu 



aaa ' 1200 

Lys 

400 

aat 1248 
Asn 



att 1296 
He 



ace 1344 
Thr 



cct 1392 
Pro 



att 1440 

He 

480. 

gtt 1488 
Val 



caa 1536 
Gin 



agt 1584 
Ser 



ctt 1632 
Leu 



tea 1680 

Ser 

560 

aat 1728 
Asn 
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565 570 575 

gga ctt etc tta act cca tgc tac act gca aac ttt gtt gca cct gag 1776 
Gly Leu Leu Leu Thr Pro Cys Tyr Thr Ala Asn Phe Val Ala Pro Glu 
580 585 590 

gtt ctt atg caa cag gga tat gat get get tgt gat ate tgg agt tta 1824 
Val Leu Met Gin Gin Gly Tyr Asp Ala Ala Cys Asp lie Trp Ser Leu 
595 600 605 

gga gtc ctt ttt tac aca atg ttg get ggc tac act cca ttt get aat 1872 
Gly Val Leu Phe Tyr Thr Met Leu Ala Gly Tyr Thr Pro Phe Ala Asn 
610 615 620 

ggc ccc aat gat act cct gaa gag ata ctg ctg cgt ata ggc aat gga 1920 
Gly Pro Asn Asp Thr Pro Glu Glu lie Leu Leu Arg lie Gly Asn Gly 
625 630 635 640 

aaa ttc tct ttg agt ggt gga aac tgg gac aat att tea gac gga gca 1968 
Lys Phe Ser Leu Ser Gly Gly Asn Trp Asp Asn lie Ser Asp Gly Ala 
645 650 655 

aag gat ttg ctt tec cat atg ctt cat atg gac cca cat cag egg tat 2016 
Lys Asp Leu Leu Ser His Met Leu His Met Asp Pro His Gin Arg Tyr 
660 665 670 

act get gaa caa ata tta aag cac tea tgg ata act cac aga gac cag 2064 
Thr Ala Glu Gin lie Leu Lys His Ser Trp lie Thr His Arg Asp Gin 
675 680 685 

ttg cca aat gat cag cca aag aga aat gat gtg tea cat gtt gtt aag 2112 
Leu Pro Asn Asp Gin Pro Lys Arg Asn Asp Val Ser His Val Val Lys 
690 695 700 

gga gca atg gtt gca aca tac tct gee ctg act cac aag ace ttt caa 2160 
Gly Ala Met Val Ala Thr Tyr Ser Ala Leu Thr His Lys Thr Phe Gin 
705 710 715 720 

cca gtc eta gag cct gta get get tea age tta gee cag cga egg age 2208 
Pro Val Leu Glu Pro Val Ala Ala Ser Ser Leu Ala Gin Arg Arg Ser 
725 730 735 

atg aaa aag cga aca tea act ggc ctg 2235 
Met Lys Lys Arg Thr Ser Thr Gly Leu 
740 " 745 

<210> 4 

<211> 745 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Leu Pro Phe Ala Pro Gin Asp Glu Pro Trp Asp Arg Glu Met Glu 
15 10 15 

Val Phe Ser Gly Gly Gly Ala Ser Ser Gly Glu Val Asn Gly Leu Lys 
20 25 30 
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Met Val Asp Glu Pro Met Glu Glu Gly Glu Ala Asp Ser Cys His Asp 
35 40 45 



Glu Gly Val Val Lys Glu He Pro He Thr His His Val Lys Glu Gly 
50 55 60 



Tyr Glu Lys Ala Asp Pro Ala Gin Phe Glu Leu Leu Lys Val Leu Gly 
65 70 75 80 



Gin Gly Ser Phe Gly Lys Val Phe Leu Val Arg Lys Lys Thr Gly Pro 
85 90 95 



Asp Ala Gly Gin Leu Tyr Ala Met Lys Val Leu Lys Lys Ala Ser Leu 
100 " 105 110 



Lys Val Arg Asp Arg Val Arg Thr Lys Met Glu Arg Asp He Leu Val 
115 120 125 



Glu Val Asn His Pro Phe He Val Lys Leu His Tyr Ala Phe Gin Thr 
130 135 140 



Glu Gly Lys Leu Tyr Leu He Leu Asp Phe Leu Arg Gly Gly Asp Val 
145 150 155 160 



Phe Thr Arg Leu Ser Lys Glu Val Leu Phe Thr Glu Glu Asp Val Lys 
165 170 175 



Phe Tyr Leu Ala Glu Leu Ala Leu Ala Leu Asp His Leu His Gin Leu 
180 185 190 



Gly He Val Tyr Arg Asp Leu Lys Pro Glu Asn He Leu Leu Asp Glu 
195 200 205 



He Gly His He Lys Leu Thr Asp Phe Gly Leu Ser Lys Glu Ser Val 
210 215 220 



Asp Gin Glu Lys Lys Ala Tyr Ser Phe Cys Gly Thr Val Glu Tyr Met 
225 230 235 240 



Ala Pro Glu Val Val Asn Arg Arg Gly His Ser Gin Ser Ala Asp Trp 
245 250 255 



Trp Ser Tyr Gly Val Leu Met Phe Glu Met Leu Thr Gly Thr Leu Pro 
260 265 270 
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Phe Gin Gly Lys Asp Arg Asn Glu Thr Met Asn Met lie Leu Lys Ala 
275 280 285 



Lys Leu Gly Met Pro Gin Phe Leu Ser Ala Glu Ala Gin Ser Leu Leu 
290 295 300 



Arg Met Leu Phe Lys Arg Asn Pro Ala Asn Arg Leu Gly Ser Glu Gly 
305 ' 310 315 ~ 320 



Val Glu Glu lie Lys Arg His Leu Phe Phe Ala Asn lie Asp Trp Asp 
325 330 335 



Lys Leu Tyr Lys Arg Glu Val Gin Pro Pro Phe Lys Pro Ala Ser Gly 
340 345 350 



Lys Pro Asp Asp Thr Phe Cys Phe Asp Pro Glu Phe Thr Ala Lys Thr 
355 360 365 



Pro Lys Asp Ser Pro Gly Leu Pro Ala Ser Ala Asn Ala His Gin Leu 
370 375 380 



Phe Lys Gly Phe Ser Phe Val Ala Thr Ser lie Ala Glu Glu Tyr Lys 
385 390 395 ' 400 



lie Thr Pro lie Thr Ser Ala Asn Val Leu Pro lie Val Gin lie Asn 
405 410 415 



Gly Asn Ala Ala Gin Phe Gly Glu Val Tyr Glu Leu Lys Glu Asp lie 
420 425 430 



Gly Val Gly Ser Tyr Ser Val Cys Lys Arg Cys lie His Ala Thr Thr 
435 440 445 



Asn Met Glu Phe Ala Val Lys lie lie Asp Lys Ser Lys Arg Asp Pro 
450 455 ~ ^ 460 



Ser Glu Glu lie Glu lie Leu Met Arg Tyr Gly Gin His Pro Asn lie 
465 470 ~ " 475 480. 



lie Thr Leu Lys Asp Val Phe Asp Asp Gly Arg Tyr Val Tyr Leu Val 
485 490 495 



Thr Asp Leu Met Lys Gly Gly Glu Leu Leu Asp Arg lie Leu Lys Gin 
500 505 510 



Lys Cys Phe Ser Glu Arg Glu Ala Ser Asp lie Leu Tyr Val lie Ser 
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515 520 525 



Lys Thr Val Asp Tyr Leu His Cys Gin Gly Val Val His Arg Asp Leu 
530 * " 535 540 



Lys Pro Ser Asn lie Leu Tyr Met Asp Glu Ser Ala Ser Ala Asp Ser 
545 550 555 560 



lie Arg lie Cys Asp Phe Gly Phe Ala Lys Gin Leu Arg Gly Glu Asn 
565 570 575 



Gly Leu Leu Leu Thr Pro Cys Tyr Thr Ala Asn Phe Val Ala Pro Glu 
580 ' 585 590 



Val Leu Met Gin Gin Gly Tyr Asp Ala Ala Cys Asp lie Trp Ser Leu 
595 600 605 



Gly Val Leu Phe Tyr Thr Met Leu Ala Gly Tyr Thr Pro Phe Ala Asn 
610 615 620 



Gly Pro Asn Asp Thr Pro Glu Glu lie Leu Leu Arg lie Gly Asn Gly 
625 630 635 640 



Lys Phe Ser Leu Ser Gly Gly Asn Trp Asp Asn lie Ser Asp Gly Ala 
645 650 655 



Lys Asp Leu Leu Ser His Met Leu His Met Asp Pro His Gin Arg Tyr 
660 665 670 



Thr Ala Glu Gin lie Leu Lys His Ser Trp lie Thr His Arg Asp Gin 
675 680 685 



Leu Pro Asn Asp Gin Pro Lys Arg Asn Asp Val Ser His Val Val Lys 
690 695 700 



Gly Ala Met Val Ala Thr Tyr Ser Ala Leu Thr His Lys Thr Phe Gin 
705 710 715 720 



Pro Val Leu Glu Pro Val Ala Ala Ser Ser Leu Ala Gin Arg Arg Ser 
725 730 735 



Met Lys Lys Arg Thr Ser Thr Gly Leu 
740 745 



<210> 5 
<211> 22 



13 



-35- 



<212> 
<213> 



DNA 

Homo sapiens 



<400> 5 

ggagcaaagg gagcaatggt tg 



22 



<210> 6 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 6 

tcttcatcca gtttggccta gg 22 



<210> 7 

<211> 20 

<212> DNA 

<213> Homo sapiens 



<210> 8 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 8 

gtgtcgctgt tgaagtcaga 20 



<400> 7 

tgggtgtgaa ccatgagaag 



20 
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